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Abstract. Let £ be any number field or p-adic field and consider F := (fi, . . . , f}^) where 
fi G C[x^^ ^ . . . , x^^] \ {0} for all i and there are exactly m distinct exponent vectors appear- 
ing in /i, . . . , /fc. We prove that F has no more than 1+ (^amn{m — l)"^ log m)"" geometrically 
isolated roots in where a is an explicit and effectively computable constant depending 
only on C This gives a significantly sharper arithmetic analogue of Khovanski's Theorem 
on Fewnomials and a higher-dimensional generalization of an earlier result of Hendrik W. 
Lenstra, Jr. for the case of a single univariate polynomial. We also present some further re- 
finements of our new bounds and briefly discuss the complexity of finding isolated rational roots. 



1. Introduction 

A consequence of Descartes' Rule (a classic result dating back to 1637) is that any real 
univariate polynomial with exactly m > 1 monomial terms has at most 2m — 1 real roots. 
This has since been generalized by Askold G. Khovanski during 1979-1987 (see | Kho80[ | and 
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| Kho91| , pg. 123]) to certain systems of multivariate sparse polynomials and even fewno- 



mials.|^ Here we provide ultrametric and thereby arithmetic analogues for both results: we 
give explicit upper bounds, independent of the degrees of the underlying polynomials, for 
the number of isolated roots of sparse polynomial systems over any p-adic field and, as a 
consequence, over any number field. For convenience, let us henceforth respectively refer 
to these cases as the local case and the global case. 

Suppose /i, . . . ,/fe € C[xf^, . . . ,x^^] \ {0} where £ is a field to be specified later, and 
m is the total number of distinct exponent vectors appearing in /i,... , fk (assuming all 
polynomials are written as sums of monomials). We call F := (/i,... , fk) an m-sparse 
k X n polynomial system over jC. Khovanski's results take £ = M and yield an explicit 
upper bound for the number of non-degenerate roots, in the non-negative orthant, of any 
m-sparse nx n polynomial system |Kho8C,Kho91]. With a little extra work (e.g., | RojOOa| , 



cor. 3.2]) his results imply an upper bound of 2'^(")n'^(™)2'^(™') on the number of isolatedPI 
roots of F in M"', and this is asymptotically the best general upper bound currently known. 
In particular, since it is easy to show that the last bound can in fact be replaced by 1 when 



m<n (see, e.g., [ LRWOl , thm. 3, part (b)]), one should focus on better understanding the 
behavior of the maximum number of isolated real roots for n fixed and m > n + 1. For 
example, is the dependence on m in fact polynomial for fixed n? This turns out to be an 



^ Sparse polynomials are sometimes also known as lacunary polynomials and, over R, are a special case of 



fewnomials — a more general class of real analytic functions [Kho91 . 
^ We say a root of F is geometrically isolated iff it is a zero-dimensional component of the underlying 

scheme over the algebraic closure of £ defined by F. For the case of £ = R one can in fact use the slightly 

looser definition that a point is topologically isolated ifi' it is a connected component of the underlying real 

zero set. Unless otherwise mentioned, all our isolated roots will be geometrically isolated. 
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open question, but we can answer the arithmetic analogue (i.e., where L is any p-adic field 
or any number field) affirmatively and explicitly: 

Theorem 1 Let p be any (rational) prime and d,5 positive integers. Suppose C is any 
degree d algebraic extension of Qp or Q, and let C* :=£.\ {0}. Also let F be any m-sparse 
kxn polynomial system over C and define B{C,m,n) to be the maximum number of isolated 
roots in (£*)"■ of such an F in the local case, counting multiplicities if (and only if) k = n. 
Then B{C,m,n) = (if m<n or k<n) and 

B{C.,m,n) <u{m,n) ^c{m — l)n{p'^ — I) 1 + dlogp | (if m>n + 1 and k>n), 

where u(m, n) is m — \, 4:(m — 1)^, or {m(m — l)/2)" according as n = l, n = 2, or n> 3; 
c:=^^< 1.582 andlogp{-) denotes the base p logarithm function. Furthermore, moving to 
the global case, let us say a root xEC" of F is o/ degree <6 over C iff every coordinate of 
X lies in an extension of degree <6 of C, and let us define A{C,S,m,n) to be the maximum 
number of isolated roots of F in (C*)" of degree <6 over C, counting multiplicities'^ if (and 
only if) k = n. Then A{£.,6,m,n)=0 (if m<n or k<n) and 

A{C,6,m,n)<2u{m,n) ^^c{m - l)n2'^^ 1 + 2d'^d'^ log2 (^ '^ '^ii™2~^^ ) } (if m>n + I and k<n). 

Our bounds can be sharpened even further, and this is detailed in corollaries |l] and |2| of 
sections ^ and ^, respectively. 

Remark 1 At the expense of underestimating some multiplicities, we can easily obtain 
a bound for the number of isolated roots of F in £" (in the local case) or roots in C" of 
degree <6 over L (in the global case): By simply setting all possible subsets of variables to 



The multiplicity of any isolated root here, which we take in the sense of intersection theory for a scheme 



over the algebraic closure of jC. [Ful98|, turns out to always be a positive integer (see, e.g., |Smi97, Roj99]). 
* e.g., roots on the coordinate hyperplanes may have multiplicities >1 counted as 1 instead. 
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zero, we 



easily obtain respective hounds ofl + Yl^=i 



n 



B{C,m,j)<l + 2'^B{C,m,n) and 



n 



A(C, 6, m,j) < 1 + 2"A(£, 6, m, n). Of course, since many of the monomial 



V) 



terms of F will vanish upon setting an Xi to 0, these bounds will usually he larger than really 
necessary, o 



Example 1 Consider the following 2x2 system over Q2 



12,1 ^0-2,2 



+ 



which is m-sparse for some m< fj, + 2. Theorem |J and an elementary calculation then tell 
us that such an F has no more than 

M+l ^ ^ ^ 



41(^ + 1)4 (l+log2(j^693 

isolated roots, counting multiplicities, in (Q2)^ (and (Q*)^ as well, via the natural embedding 
Q Q2)- For instance, /u = 3 =^ F is at worst 5-sparse and ha^ no more than 127645 
roots in (Q2)^- Explicit bounds independent of the total degrees of fi and /2 appear to have 
been unknown before. However, if we replace Q2 by M throughout, then the best previous 
upper bounds were 4(2^^ — 2) for all > 1 and a bound of 20 in the special case fi = 3. 



Interestingly, the latter bounds, which follow easily from [LRWOl, thm. 1], in fact allow us 

to take real exponents and count topologically isolated roots, but without multiplicities^ 
^ The numerical calculations throughout this paper were done with the assistance of Maple, and the code 

for these calculations is available from the author's web-page. 
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Khovanski's Theorem on Fewnomials [Kho91, cor. 7, sec. 3.12], which only counts roots with non-singular 



Jacobian, implies an upper bound of 995328 for = 3. 
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The real analytic upper bound exceeds our arithmetic bound for all fj, > 29, where both bounds 
begin to exceed 1.3 billion. A sharper bound, based on a refinement of theorem^ (cf. corollary 
|Ip, appears in example |^ of section ^. o 



Example 2 Another consequence of theorem |^ is that for fixed C, we now know log B{C, m, n) 
and log A{C,m,n) to within a constant factor: For m>2 consider the m-sparse nx n poly- 
nomial system F = {fi, . . . , /„) where fi = YYjl=i i^i ~ i) f'^''" o-U i- Clearly then, this F has 
exactly (m— 1)" isolated roots mN". It is curious that the analogous growth-rate is unknown 
if C is replaced by the usual Archimedean completion M ofQ.o 



A weaker version of theorem ^ with non-exphcit bounds was derived earher in [ Roj01b| . 



In particular, explicit bounds were known previously only in the case k = n = l |Len99b, thm. 



1 and thm. 2], and all our bounds (save the global case) match the bounds of p^en99b | in 



this special case|] Philosophically, the approach of | Len99b| was more algebraic (low degree 



factors of polynomials) while our point of view here is more geometric (isolated rational 
points of low degree in a hypersurface intersection) The only other results known for k>l 
or n > 1 were derived via rigid analytic geometry and model theory, and in our notation 
yield a non-effective bound of B{Qp,m,n) <oo (see the seminal works [DV88, Lip88| ). 



^ In order to streamline the proof of our number field generalization, we left our bound on A{C,5,m,n) 

in theorem |l| a bit loose: for n = l our bound reduces to O (d^5'^m^2''* log(d5m)), while the older univariate 

result yields O {d5m^2'^^ log(d5m)) in our notation. A sharper bound, agreeing with Lenstra's univariate 

bound when n — 1, appears in corollary ^ of section ^. 
* Lenstra has also considered a higher-dimensional generalization but in a different direction: bounds for 

the number of rational hyperplanes in a hypersurface defined by a single m-sparse n-variate polynomial 



Len99b, prop. 6.1] 
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Our approach is simpler and is based on a higher-dimensional generalization (theorem ^ 
of the next section) of a result of Hendrik W. Lenstra, Jr. for univariate sparse polynomials 



over certain algebraically closed fields [Len99b, thm. 3]. Indeed, aside from the introduction 



of some higher-dimensional convex geometry, our proof of theorem |^ is structurally quite 



similar to Lenstra's proof of the k = n = l case in [ Len99b |: reduce the global case to the 
local case, then reduce the local case to a refined result over the p-adic complex numbers. 

We now describe two results used in our proofs which may be of broader interest. We 
also point out that connections between our results and complexity theory, including the 
question of whether we can find isolated rational roots in polynomial time, is described in 
section |^. 



1.1. The Distribution ofp-adic Complex Roots 

For any (rational) prime p, let Cp denote the completion (with respect to the p-adic metric) 
of the algebraic closure of Qp. Theorem |^ follows from a careful application of two results 
on the distribution of roots of F in (C*)"". The first result strongly limits the number of 
roots that can be p-adically close to the point (1, . . . ,1). The second result strongly limits 
the number of distinct valuation vectors which can occur for the roots of F. 

Theorem 2 Let F be any m-sparse kxn polynomial system overCp. Also letri, . . . , r„ >0, 
r := (ri, . . . ,r„), and let ordp : Cp — Q U {+00} denote the usual exponential valuation, 
normalizec^ so that oidpp = 1. Finally, let Cp{m,n,r) denote the maximum number of 
isolated roots (xi, . . . , x„) of F in Cp with ordp(xi — 1) > for all i, counting multiplicities 
^ So, for example, ordpO = +oo and aiAp(j)^r) — k whenever r is a unit in and fcGQ. 
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if (and only if) k = n. Then Cp{m, n,r)=0 (if m<n or k<n) and 

(m - 1)" 



Cp{m, n,r)< < c{m — 1) 



h r„ + log 



n 



i=l 



(if m>n + l and k>n), where c:=^^< 1.582. Furthermore, if we restrict to those F where 
k = n and fi has exactly mi monomial terms for all i, then we have the sharper hounds of 
Cp{m,n,r) = (if mi <1 for some i) and 

{mi - ly 



Cp(m,n,r)<c"J]<! K - 1) 

i=l 



ri H h r^i + log 



ri • • • r„ log" p 



Ti 



(if mi, . . . ,mn>2). 



These bounds appear to be new: the only previous results in this direction appear to 
have been Lenstra's derivation of the special case n = 1 ||Len99b| , thm. 3] and an ear- 



lier observation of Leonard Lipshitz [Lip88, thm. 2] equivalent to the non-explicit bound 
Cp(m,n, (1, . . . , l))<cx). 

Our last bound over is based on a toric arithmetic-geometric result of Smirnov, stated 
below. 



Definition 1 Foranya = {ai,... , a„) €Z", /et : = x°^ • • • x^" . Writing any f ££.[xi 



as YlaeZ" '^ax"'' '^^ ^'^^^ Supp(/) := {a \ Ca/0} the support of f. Also, let ir : M"+^ — > M" 
be the natural projection forgetting the Xn+i coordinate and, for any n-tuple of polytopes 
P = (Pi,... ,Pn), define TriP):={n {Pi),... ,7r(Pn)).o 



Definition 2 For any kxn polynomial system F over Cp, its fc-tuple of p-adic Newton 

polytopes, Newtp(F) = (Newtp(/i), . . . ,Newtp(/fc)), is defined as follows: Newtp(/i) := 

Conv({(a, ordpCa) | a G Supp(/i)}) cM"'^^, where Conv(S') denotes the convex hull o^ a 
i.e., smallest convex set containing... 



Arithmetic Multivariate Descartes' Rule 9 

set S'CM"+i. Also, for any weW' and any closed subset ScM", let the face of B with 
inner normal w, , he the set of points x^B which minimize the inner product w ■ x. 
Finally, /et Newt^(F) := (Newt^(/i), . . . , Newt;;'(/fc)). o 



Smirnov's Theorem lSmi91 , thm. 3.4] For any nxn polynomial system F over Cp, the 



number of isolated roots (xi, . . . of F in (C*)" satisfying ordpXj=rj for all i (counting 

multiplicities) is no more than M (it (Newt p{F))) , where r:=(ri, . . . ,r„, 1), Al(') denotes mixed 
volume iBZS^j (normalized so that M{Coiiv{{0,ei, . . . ,e„}),... , Conv({0, ei, . . . ,e„})) = lj, 
and ei is the i— standard basis vector of M" . ■ 



Remarks 2 



0. Explicit examples of the preceding constructions are illustrated in [RojOllj. 

1. The number of possible distinct valuation vectors for a root of an n-variate polynomial 
system F can thus be combinatorially bounded from above as a function depending solely 
on n and the number of monomial terms (cf. section^. 

2. The number of roots of F in (C*)" with given valuation vector thus depends strongly on 
the individual exponents of F — not just on the number of monomial terms. 

3. It is thus only the loweif^ faces of the p-adic Newton polytopes that matter in counting 
roots or valuation vectors, o 

We prove theorem ^ in section ^. However, let us first show how theorem |^ implies 
theorem ^: We will begin by examining the local case in the next section, and then complete 
our proof by deriving the global case from the local case in section 



Those with positive Xn+i coordinate for their inner normals... 
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2. The Local Case of Theorem | 

Here we will assume that C is any degree d algebraic extension of Qp. The following lemma 
will help us reduce to the case k = n. 

Lemma 1. (See jRojOll , lemma Ij.) Following the notation of theorem^, there is a matrix 



[aij] C Z"^'^ such that the zero set of G := (an/i + • • • + aife/fc, • • • , a„i/i + • • • + Unkfk) in 
Cp is the union of the zero set of F in and a finite (possibly empty) set of points. ■ 

Proof of the Local Case of Theorem ^: It is clear that there are no isolated roots 
whatsoever if k<n, since the underlying algebraic set over Cp is positive-dimensional. So 
we can assume k>n. In the event that k> n, lemma ^ then allows us to replace F by a 
new n X n polynomial system (with no new exponent vectors) which has at least as many 
isolated roots as our original F. So we can assume k = n and observe that root multiplicities 
are preserved if lemma |l| was not used (i.e., if we already had k = n in our input). Since an 
m-sparse n x n polynomial system clearly has no geometrically isolated roots whatsoever 
when m<n, we can clearly assume that m > n + 1. (Indeed, upon dividing each equation by 
a suitable monomial, m<n clearly implies that we can obtain n linear equations in < n — 1 
non-constant monomial terms.) 



The well-known classification of when mixed volumes vanish |BZ88| then yields that 
A1(7r(Newtp(F))) > =^ there are linearly independent vectors vi, . . . , u„, with Vi an edge 
of Newtp(/j) for all i. So let £i be the number of edges of Newtp(/j). If n = 1 then we clearly 
have £i <m — I for all i, and this is a sharp bound for all m. If n > 2 then we have the 
obvious bound of £{ <m{m — l)/2 for all i, and it is not hard to generate examples showing 



that this bound is sharp for all m as well [ pde87| , thm. 6.5, pg. 101]. If n = 2 then note that 



the number of edges of Newtp(/j) is clearly not decreased if we triangulate the boundary of 
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Newtp(/j). Since each 2-face of the resulting complex is incident to exactly 2 edges, Euler's 



relation [Ede87, thm. 6.8, pg. 103] then immediately implies that £i<2m — 2 for all i. (This 
bound is easily seen to be sharp for all m>4.) 

We thus obtain that there are no more than £i ■ ■ ■ £n 1^ u{m, n) possible values for an 
rGM" with f = (r, 1) and A4(7r(Newtp(F))) > 0. In particular, by Smirnov's Theorem, this 
implies that the number of distinct values for the valuation vector (ordpXi, . . . ,ordpX„), 
where (xi, . . . , x„) G (C*)" is a root of F, is no more than u{m, n). So let us temporarily fix 
(ri, . . . ,rn) ■=r and see how many roots of F in (C*)"" can have valuation vector r. 

Following the notation of theorem ^, let Rp := {x € Cp [ |x|p < 1} be the ring of al- 
gebraic integers of Cp, let Mp := {x G Cp | |x[p < 1} be the unique maximal ideal of Rp, 
¥c := {Rp n £)/ (Mp n C) , and let p be any generator of the principal ideal Mp (IC of RpCiC 
Also let ec '■= maXyg£*{|ordpy|^"'^} and qc ■= #F£. (The last two quantities are respec- 
tively known as the ramification degree and residue field cardinality of C, and satisfy 



e£,logpg£€N and e^log^g^ = d [|Kob84| , ch. III].) Since ordpp = l/ec, it is clear that r a 
valuation vector of a root of F in (£*)" =^ re {Z/ec)"'- 

Fixing a set Ac C Rp of representatives for F£ (i.e. , a set of qc elements of i?p fl £, exactly 
one of which lies in Mp, whose image mod Mp DC is F£), we can then write any Xi e C 
uniquely as X]j'=^£ri ^j*^/^ some sequence of a^*^ eAc ||Kob84 , corollary, pg. 68]. Note in 
particular that ^(^j^e^r^ thus lies in Rp\Mp for any a^*^ eAc\Mp. 



Theorem ^ thus implies that the number of isolated roots (xi, . . . ,x„) of F in (C 
satisfying (ordpXi, . . . , ordpX„) =r and ^^i-^'^e^r^ = ... = ^^^^^^^..^ = 1 (mod Mp) is no more 
than Cp(m, n, (l/e£, . . . ,1/ec))). Furthermore, since Mp n £ C Mp, we obtain the same 
statement if we restrict to roots in (£*)" and use congruence mod Mp fl C instead. 
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Since there are g£ — 1 possibilities for each Oq , our last observation tells us that the 
number of isolated roots (xi, . . . , Xn) of F in satisfying (ordpXi, . . . , ordpX„) = r is 

no more than {qc — l)"Cp(m, n, {1/ec, • • • , 1/e^)). So the total number of isolated roots of 
F in (£*)"■ is no more than u{m,n){qc — l)"Cp(m, n, (l/e£, . . . ,1/ec))- Since ec<d and 
Qc^p'^j an elementary calculation yields our desired bound. ■ 

A simple consequence of our last proof is that there is a natural injection from the set of 
possible valuation vectors of an isolated root of F to the set of lower facetsl^ of a particular 
polytope. In particular, we can define Sp{F) to be Newtp (^Yli=i fi^ or the Minkowski sum 
Y17=i N6wtp(/j), according as k>n or k = n, and immediately obtain the following corollary. 



Corollary 1 Following the notation above, we have 

B{C, m, n) < T{F){qc - l)"Cp(m, n, {l/ec, ■■■ , l/ec)), 

where J~{F) is the number of lower facets of Ep[F), and qc and ec are respectively the 
residue field cardinality and ramification index of C M 

Example 3 Returning to example^, observe that fi has <3 monomial terms (so Newt2(/i) 
has <3 edges) and f2 has <fx monomial terms (so Newt2(/2) has <2/x edges (cf. our use of 
Euler's formula in the proof of the local case of theorem So we in fact have J'{F) <6/x 
(for all /x>4j and J-{F)<9 (for n = 3). Corollary^ then implies improved upper bounds of 

30A{fi - 1)1^1 (l+log J l^]^ (for all fi> A) and 230A (for fi = 3) 



cf. part 3 of remark |^ of section 1.1. Recall that a facet of a d-dimensional polytope is simply a face of 
dimension d — 1. 
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for the number of roots of F in (Q2)^- ^^so, our refined bound is smaller than the afore- 
mentioned real analytic bound (of. example || of section [Jj for all m > 17, where the two 
bounds begin to exceed 456800. o 



Example 4 It is entirely possible that the maximum number of roots in {C*y^ of an m- 
sparse n x n polynomial system over L is actually larger for C = Q2 than for £. = M, for 
small m and n. In particular, a univariate trinomial over M clearly has at most 4 nonzero 
real roots. However, SxJ'^ + xf — 4 has exactly 6 nonzero roots in Q2 cL^d this is the maximum 
possible number of roots in Q2 /o'^ univariate trinomials over prop. 9.2]. o 



3. The Global Case of Theorem ^ 

Let us start with a construction from |Len99b, sec. 8] for the univariate case: First, fix a 



group homomorphism Q — > C2, written r 1-^ 2^ , with the property that 2^ =2. To construct 
2^ for an arbitrary rational r, choose 2^/"' inductively to be an n— root of 2^^^"""^^' , and then 
define 2"/"' to be the a— power of 2^/"' for any 0^%. Clearly, ord2(2^') = r for each r € Q. 
For J, e E N we then define the subgroups C/g and Tj of C2 by C/g := {x \ ordp(x — 1) > 1/e} 
and Tj :={C I C^'"! = 1}. Note that JJe ^ U^i if e<e', and T,- CTj/ if j divides /. 

What we now show is that in addition to having few roots of bounded degree over Q2, 
F has few roots in another suprisingly large piece of (C2)"'. 

Lemma 2. Let e,j, /cGN, and let F be an m- sparse nxn polynomial system over C2. Then 
F has at most T{F){2^ - l)"C2(m, n, (1/e, . . . , 1/e)) roots in the subgroup (2^ • Tj ■ UeT of 
(C2)", where T{F) is as defined in corollary |^ of section 
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Proof: First note that the case n = 1, in shghtly different notation, is exactly lemma 
8.2 of |Len99b|. The proof there generalizes quite easily to our higher-dimensional setting. 



Nevertheless, for the convenience of the reader, let us give a succinct but complete proof. 

First note that by theorem ||, F has no more than C2{m, n, (1/e, . . . , 1/e)) roots in U". 
By the change of variables (xi, . . . , Xn) ^ (aiyi, • • • , anUn) we then easily obtain the same 
upper bound for the number of roots of F in any coset of Since T" clearly has order 
(2-^ — 1)*^, F thus has no more than (2-^ — l)"'C2{m,n,{l/e, . . . ,1/e)) roots in any coset 
X • • • X (2''"TjC/e). Since Smirnov's Theorem implies, via our proof of the local 
case of theorem [l| (cf. section |2|), that a root x G (C2)" of F can produce no more than J-{F) 
possible distinct values for (ri, . . . ,r„) :=(ord2Xi, . . . ,ord2X„), we are done. ■ 

To at last prove the global case of theorem ||, let us quote another useful result of Hendrik 
W. Lenstra, Jr. Recall that [x] is the least integer greater than x. 



Lemma 3. [ Len99l\ , lemma 8.3] Let nGN and let L he an extension 0/Q2 of degree <D. 
Then there is a j ^ {1, . . . , Z)} such that L* C2'^TjC/|-(^/j-|^. ■ 

Proof of the Number Field Case of Theorem |l]: 

Since Q naturally embeds in Q2, we can assume £ is a subfield of C2 of finite degree over 
Q2. Then every root of F in (C2)" of degree < 6 over C lies in (L *)'^, where L' is an 
extension of Q2 of degree at most D ■.= d5. So by lemma ^, any such root of F also lies in 

From lemma |2| it now follows that the number of roots of F of degree < 6 over £ is no 
more than Zf=iHF){2^ " 1)"C'2 (m,n, [jjyfjjD, JJJJJJd)) ■ Since 2^ - 1<2^ J^{F)< 
u{m, n) (cf. the proof of the local case of theorem [l| in section Q), and C2(m, n,{r, . . . , r)) 
is a decreasing function of r, we thus obtain by geometric series that A(C, 5, m, n) < 
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2"-'^^+^ u{m, n)C2 (rn,n, {-;]7ji, ■ ■ ■ , ■^jTgi)) ■ So by theorem ^ and an elementary calculation 
we are done. ■ 

By leaving the last sum in our proof above unsimplified, we immediately obtain the 
following improvement of theorem |2|. 

Corollary 2 We have 

1 1 



5, ™, „) < HF) E(2' - yn,, n. ^ ^^^^^^^^ 

where T{F) is as defined in corollary |^ of section 



4. Proving Theorem ^ 

We begin with a clever observation of Hendrik W. Lenstra, Jr. on binomial coefficients, 
factorials, and least common multiples. Recall that a\b means that a and b are integers with 
a dividing b, and that [xj denotes the greatest integer <x. 



Definition 3 j Len99l , sec. 2] For any nonnegative integers m and t define (im{t) to he 
the least common multiple of all integers that can he written as the product of at most m 
pairwise distinct positive integers that are at most t (and set dm{t) := 1 ifm = or t = 0). 



Finally, for any a G Z, let us define 



a 

VV 



.l).o 



Lemma 4. [LenOOl, sec. 2] Following the notation of definition we have... 

(a) dm{t)\n\ 

(b) m>t =^ dm{t) = t\ 

(c) 0<f <m<t =^ i\\dm{t) 

(d) t > 1 =^> ordpdm, (t) < m [logp t\ 
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Furthermore, if Ad'L is any set of cardinality m, then there are rational numbers 
7o(A,i),... ,7m-i(vl,t) such that: 

1. the denominator of'jj{A,t) divides dm-i{t) / jl ift>m and 'yj{A,t) =6jt otherwise!^ 



2. 



Er=oS.(At) 



t 



a 



for all a^A. 



Our proof of theorem |2| will consist of a careful application of Smirnov's Theorem to the 
"shifted" polynomial system . . . , Xn) ■= F{1 + xi, . . . , 1 + x„). (So roots of F close 

to (1, . . . , 1) are simply translations of roots of G close to (0, . . . , 0).) Since the gi can be 
highly non-sparse, one might not expect Smirnov's Theorem to give bounds independent of 
the degrees of the fi on the number of roots of G close (0, . . . ,0). However, lemma ^ and 
lemmata ^ and ^ below, save the day. 



Lemma 5. Letc:=-^^ (so c<1.582) and ti,ri, ... ,tn,rn>0. Th- 



en 



n n n 

{viti - (m - 1) logpti) <{m-l)Y,ri^Yl '<'i^i < c(m - 1) 

1=1 4=1 i=l 



n 



i=l 



(m-1)" 



Proof: Here we make multivariate extensions of some observations of Lenstra from |Len99b, 
prop. 7.1]: First note that it is easily shown via basic calculus that 1 — assumes its 
minimum (over the positive reals), 1/c, at x = e. So for all x>0 we have x> (logx) + x/c. 
Letting t, r>0, w := ™ogp , and x: = t/w, we then obtain 

rt>rwx>rw{{hgx) + x/c) = rw{hgt) - ™(log w) + rt/c={m - l)(logpf) - (m - 1) logp ^f^^j + '''t/c- 
Substituting r = ri, t = ti, and summing over i then implies 

{*) '^^^ > ("^ - 1) E log, i :..'n ) + - E 

2=1 \2=1 / 2=1 



Sij denoting the Kronecker delta, which is when iy^j and 1 when 



i = j. 
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Now suppose that 

n 

^ Titi > c{m - 1) 



i=l 



{m - ly 



n-'-rn log" p 



Substituting into the last sum of the right hand side of our inequahty (★) then 
tells us 



J2 nti > (m - 1) XI logp - (to - 1) loi 



i=l 



(m~l)" 
\ ri---r„log"p 



+ (m - 1) 



i=l 



(m-l)" 



i=l 



i=l 



So we obtain ^ rjtj > ("^ — 1) ( logp I + (m — 1) ( X^ r-j I , which can be rearranged 



i=l 



into 



{k -k -k) ^{riU - {m-l) logp ti) > {m - 



1=1 



So and we conclude simply by taking the contrapositive. ■ 

The following lemma is a simple consequence of the basic properties of polytopes, their 



faces, and their mixed volumes |BZ85]. 



Lemma 6. Following the notation o/ |i.i| , let G:={gi, . . . , Qn) he any nxn polynomial sys- 



tem and let r := (ri, . . . , r„) be such that ri>0 for all i. Also letw(gi,r) := vr 



U Newt;((7^) 

. s: = {si,... ,s„,l) 
\ Si>ri for all i 



for all i. Then Yl A^(7r(Newt^(G))) < M{Conviw{gi,r)), . . . , Conv(u;(5„, r))). 

s:={si,... ,s„,l) 
Si>ri for all i 

In particular, if Q = {{ti, ■ ■ ■ , tn) G 1^" | riti + • • • r„t„ < 1 and tj > for all j}, then 
M{Q^_^) = 1/U7=in. ■ 

n times 

Proof of Theorem |^: 

First note that just as in the proof of the local case of theorem |l] (cf. section we have that 
k <n OT m<n implies that there are no isolated roots whatsoever. So we can assume that 
m>n + 1. Also, again like in the proof of the local case of theorem |l], we can safely assume 
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via lemma |l| that k = n and observe that root multiphcities are preserved if we already had 
k = n in our original input. Furthermore, if nii is the number of monomial terms occuring in 
fi for all i, then it is easily checked that mj < 1 for any i implies that there are no isolated 
roots at all. So we can also assume that mi, . . . , > 2. 

Let us now set gi{xi, . . . , Xn) '■= + xi, ... , 1 + Xn) for all i and G := {gi, . . . , gn)- It 
is then clear that the number of isolated roots of F with ordp(xi — 1) > for all i is the 
same as the number of isolated roots of G with ordpXj > for all i, and multiplicities are 
preserved by this change of variables. Smirnov's Theorem tells us that the latter number 
(counting multiplicities) is exactly^ Yl A^(7r(Newtp(G))). 

s: = (si,... ,s„,l) 
Si>ri for all i 

Now let us define the following scaled standard simplex: 



S{m,n,r):=l (ti, 



E Tjh < c(m - 1) 



(m-l)" 



and tj > for all j 



Note then that by lemma |6|, ^A{S{'ml,n, r), . . . , S{mn,n, r)) is exactly 



n 

1=1 







< {m, ~ 1) 









K - 1)" 

n • • • r„ log"p 




since mixed volume is multihomogeneous with respect to scalings |BZ85]. Since S{m,n,r) 
is clearly always convex, and since w{gi,r) is a union of convex hulls of subsets of Supp((7j), 
we also have that w{gi,r) n Supp(5j) C ^(mj, n, r) =^ Conv {w{gi,r))QS{mi,n,r) for all i. 

Since mixed volume is monotonic with respect to containment ]BZ88[ , lemma ^ then 
clearly implies that... 



Note that the sum over s is actually infinite, but has only finitely many nonzero summands. This is 
because any polytope has only finitely many inner facet normals with last coordinate 1, and it is only these 



terms which can possibly be nonzero. 
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To prove theorem 2Qwe need only show that w{gi,r) r\Supp{gi)CS{mi,n,r) for all i. 

To do this, we will first prove that the valuations of the coefficients of any gi satisfy a "slow 
decay" condition, and then use convexity of the gently sloping lower faces of the p-adic 
Newton polytopes Newtp((7j) to prove that w{gi,r) (1 Supp(5fi) C 5(mj, n, r) for all i. 

Let us temporarily abuse notation slightly to avoid a profusion of indices and respectively 
write /, g, and m in place of fi, gi, and (for some arbitrary fixed i). Letting Di :=deg^.. /, 

( 

it is clear that we can write g{x) ■=J2jeYl" ^{0 A} ^i^"' ' where bj ■=J2aeA ^o. HILi 

\ji 

fi^)= Yl Cax" (with every Ca nonzero), j = (ji,... and j4 := Supp(/). Since 

a=(ai,... ,a„)eA 

/ 7^0 we have g^O and thus not all the bj vanish. Note also that Di = =^ Supp{g)Q{x € 
I Xj = 0}. Letting VTj : M" — > M"^^ denote the natural orthogonal projection forgetting 
the i— coordinate, it is then clear that 

TTi {S{in,n, (ri, . . . ,r„))n{xGM" | Xi = 0})^ S{m,n - 1, (n, . . . ,ri_i,rj+i, . . . ,r„)), 

and thus Di = implies that we can reduce to a case where n is smaller. So we can assume 
henceforth that Di , . . . , Dn > 0. 

By lemma^ there are rational numbers {7-*^ (tj)}, with (i, j) G {I,--- ,n} x {0,... ,m — 1}, 



such that for all a = (oi, . . . , an) G A we have 



and the denomi- 



nators of the {'^^j^ {ti)} are not too divisible by p. (We will make the latter assertion precise 
in a moment.) 

n 

We thus obtain that for all t:={ti,. . . , t„) G ]^ {0, . . . , Di}, 

1=1 



n m— 1 / 



ti I aeA i=lji=0 



V 




E n 

aeA je{0,... ,m-l}" i=l 



















1) 
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E fn7i'fe))E=.nrl= e fn^i'^))".- 

i=l 



ie{0,... ,m-l}" \i=l / aeA i=l 



\3i J je{o,...,m-i}" 



So the coefficients {bt}teYl"_j^{o,... ,Di} of g are completely determined by a smaller set of 
coefficients corresponding to the exponents of g lying in {0, . . . , m — 1}". Even better, lemma 
tells us that ti<m — 1 =^ "yj^^U) = for all ji^ti. So we in fact have that 

Cs?) ti<m — 1 =^ the recursive sum for bt has no terms corresponding to any j with jiy^ti. 

Given this refined recursion for bt we can then derive that the p-adic valuation of bt 
decrease slowly and in a highly controlled manner: First note that our recursion, combined 
with Cn?) and the ultrametric inequality, implies that 

{n \ n 

ordp(6,) + V ordpT^') (U) \ for all t G TT{0, . . . , A}, 
^ 1 
1=1 J 1=1 

where Mj is the subset of {0,... ,m — 1}" obtained by the intersection, over all i with 
ti<m — 1, of the hyperplanes {t €M" | ti=ji}. Then, by the definition of a face with inner 
normal (s, 1), we have {t, bt) G Newtp'^\g) =^ iY17=i ^i^i) + ordp6t < iJ2'i=i sdi) + ordpftj 
for all jGnr=i{0' • • • i-^j}- So for all such j we must have ovdpbj > OTdpbt + J2i=i Si{ti—ji). 
In particular, we obtain that 



{•k-k) [{t, bt) G Newt^'^'"^^((7) and ti >ji and Sj >rj for all i] =^ ordpbj > ordpbt + rj(tj 



Ji 



Since t G Supp(5) and (t,ordp6t) G Newtp''^\g) implies that oidpbt < oo, we can thus 
combine (-k) and (irk) to obtain that 



t£w{g,r) n Supp(5) ^> ordp6t > min { ovdpibt) + (ri(ti - jj) + ordp7^*Vi 

1=1 
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Cancelling and rearranging terms, we thus obtain that t£w{g,r) fl Supp((7) =^ 

< max 1^ (j-r, - OTdp{^^^ {U)) \ < .^^^max_^^^ |^ (j-r, - ordp(7]f (t^)) | • 

Since lemma ^ tells us that — ordp7j*^(tj) < (m — l)(logptj) — ordp(jj!) for all i, we then 
obtain 

(*) EHi - (m - 1) log ti) < max 0>i - ordp(ji!))} < (m - 1) ^"^^ r^. 

jejO,... ,m— 1}" 

So by lemma |5| we obtain that w{g, r) n Supp(g') C 5(m, n, r), and thus w{gi,r) n Supp(5j) C 
S{mi,n,r) for all i. ■ 



5. Connections to Complexity Theory 

Thanks to our results, we now know in particular that the maximum number of isolated 
rational roots of a A; x n polynomial system over Q depends polynomially on the number 
of distinct exponent vectors, for fixed n. Here we note that it would be of considerable 
interest to know if this polynomiality persists relative to even more efficient encodings of 
polynomials. 

In particular, instead of monomial expansions (a.k.a. the sparse encoding), consider the 



straight-line program (SLP) encoding for a univariate polynomial [BCSS98, sec. 7.1]: 
That is, suppose we have p € Z[xi] expressed as a sequence of the form (1, xi,q2, ■ ■ ■ , qn), 
where qN=P and for all i>2 we have that qi is a sum, difference, or product of some pair 
of elements {qj,qk) with j,k<i. Let r(p) denote the smallest possible value of — 1, i.e., 
the smallest length, for such a computation of p. Clearly, t{p) is no more than the number 
of monomial terms of p, and is often dramatically smaller. 
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Theorem 3 ^BCSSQi, thm. 3, pg. 127] Suppose there is an absolute constant k such that 
for all nonzero p£.'L\x\\, the number of distinct roots of p in Z is no more than (t(p) + I)''. 
Then Pc/NPc- ■ 



In other words, an analogue (regarding complexity theory over C) of the famous unsolved 
P = NP question from computer science (regarding complexity theory over the ring Z/2Z) 

7 

would be settled. The question of whether Fq = NPc remains open as well but it is known 
that Pc = NPc =^ NP C BPP. (This observation is due to Steve Smale and was first 
published in [ 3hu93| .) The complexity class BPP is central in randomized complexity and 



the last inclusion (while widely disbelieved) is also an open question. The truth of the 
hypothesis of theorem |3|, also know as the r-conjecture, is yet another open problem, even 
for K=l. 

One can reasonably suspect that a sufficiently good upper bound for the number of 
integral roots of an m-sparse k x n polynomial system could be applied to settling the 
T-conjecture:|3 Indeed, any computation of p of length t{p) can be specialized to obtain a 
computation of the same length for p{k) for any integer k. Finding an integral root of p 
can then be reinterpreted as finding all possible values for the first coordinate of an integral 
root of the 2r(p)-sparse r(p) x {t{p) + 1) polynomial system defined by the corresponding 
computational sequence for p. However, the number of variables grows linearly with t{p), 
so this route toward an application of theorem || would at best give us an upper bound 
exponential in t{p). For better or worse, we thus arrive at a Diophantine problem currently 



^ i.e., Diophantine results for multivariate polynomial systems in the sparse encoding can be useful for 
Diophantine problems involving univariate polynomials in the SLP encoding. 
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out of our grasp: finding sharp bounds on tlie number of integral points on certain algebraic 
sets defined by quadratic binomials and linear trinomials. 

A reasonable alternative approach would be to use the embedding of Q in another com- 
plete field — M, in particular. Over M there are results for univariate polynomials involving 
an even sharper encoding: For any p€M[2;], let its additive complexity, cr(p), be the mini- 
mal number of additions and subtractions necessary to express p as an elementary algebraic 
expression with constant exponents, e.g., p{x) = (1 — (x -|- 2)^'''')^'' -|- 243(x — 7)^^^ has 
cr{p) < 4, and it is clear that r(p) > 4. More generally, it is easily checked that <t{p) 
for all pG7j[xi]. Remarkably, one can bound the number of real roots of p solely in terms of 
cr{p). This was known since the work of Allan Borodin and Stephen A. Cook around 1974 
||BC76| ], and the best current upper bound is Jean- Jacques Risler's C"^^^^^, for some abso- 
lute constant C€(l,32) |Gri82 , Ris85| ]. Unfortunately, there are examples of p€Z[xi] with 



cr{p) = 0{r) and at least 2*" real roots (all of which are irrational) [RojOOb, sec. 3, pg. 13]. 
So additive complexity is too efficient an encoding to be useful in settling the r-conjecture, 
at least over M. 

Whether analogous (hopefully polynomial) bounds in terms of a sharper encoding exist 
in our arithmetic setting is an open question, even for n = l. In particular, it is interesting 
to note that the only obstructions to refining theorem |^ to a sharper encoding are (a) the 
strong dependence of the quantity J-'{F), arising from our application of Smirnov's Theorem, 
on the number of monomial terms, and (b) the existence of an analogue of theorem ^ for a 
sharper encoding. 

As for actually finding all the isolated rational roots of F, there is both good news and 
bad news: The bad news is that one can not have a polynomial time algorithm (relative to 
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the sparse encoding) for n> 1. The good news is that there is a polynomial time algorithm 
(relative to the sparse encoding) for n = l, and that the counter-examples for n> 1 are very 
simple. 

In particular, if we take jC = Q and measure the input size simply as the number of digits 
needed to write the coefficients and exponents of F in, say, binary; then it possible for an 
isolated rational root of F to have bit size|^ exponential in the bit size of F: Simply consider 
k = n = 2, 171 = 4, and F := (xi — x^, X2 — 2). This particular example clearly has bit size 
©(log D) but its one rational root (2^, 2) has a first coordinate of bit size D — exponential 
in the bit size of F. Thus one can't even write the output in polynomial time relative to 
the sparse encoding. Similar examples with bit size 0{nlogD) and having a single rational 
root, but with root coordinates of bit size J7(D"), are easy to construct for all n>3 via the 



same recursive idea [RojOOb, pg. 16, complication Q2]. With a bit more work one can even 
show that such roots of "excessively large" bit size occur not only in a worst case sense but 
also in an average case sense. 

On the other hand, it is a fortunate accident that the absolute logarithmic height of 

a complex root of F of degree < 6 over C (and thus equivalently, the bit size of such a root) 



is polynomial in the bit size of F for n = l and £ a number field |Len99a, prop. 2.3]. This 



is what permits a clever polynomial time algorithm for solving F when n = l and £ and 6 



The bit size of an integer is thus implicitly the number of digits in its binary expansion, and the bit 
size of a rational number can be taken as the maximum of the bit sizes of its numerator and denominator 
(written in lowest terms). 
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are fixed |Len99a, first theorem] .P] For n> 1 it thus appears that the only way to achieve a 



polynomial time algorithm would be to allow a more efficient encoding of the output than 
expanding into digits. In particular, it is an open question, even for n = 2, whether one can 
always find SLP's, of length polynomial in the bit size of F, for the isolated rational roots 
of F. 

Alternatively, one can simplify the question of solving and simply ask how many isolated 
rational roots F has, or whether F has any isolated rational roots at all. This is addressed 



in [ RojOla , thms. 1.3 and 1.4], where it is shown that the truth of the Generalized Riemann 
Hypothesis implies that detecting a strong form of non-solvability over the rationals (tran- 
sitivity of the underlying Galois group) can be done within the complexity class p^P'^''^ 
provided the underlying complex zero set is finite. In the latter result, n is allowed to be 
part of the input and can thus vary. 
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